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CHARACTEKIZATIONS OF LINEAR SUFFICIENT STATISTICS 

by 

B. Charles Peters, Jr.,‘ Richard Rednor,* 
and Henry P. Decell, Jr.* 

We develop necessary and sufficient conditions that a surjective 
bounded linear operator T from a Banach space X to a Banach 
space Y be a sufficient statistic for a dominated family of 
probability measures defined on the Borcl sets of X . Wc Rive 
applications of these results that characterize linear sufficient 
statistics for families of the exponential type, including as 
special cases the Wishart and multivariate normal distributions. 
The latter result is used to establish precisely which procedures 
for samplinc from a normal population have the property that the 
sample mean is a sufficient statistic. 


'Author was partially supported by NASA/JSC Contract NAS-9-15000 
with the University of Houston during the preparation of this 
work . 


1. Introduction ; Lot T be a surjective measuroablo trans- 


formation from U»o measuroablo space* (X,A) to the measuroablo 

I 

6pac€» (Y,8) , and let V be a set of totally finite measures on 
A .* Following' Halmos and Sava(;e [2], we say that T is a 
sufficient statistic relative to V if for each E t A there 
exists a measureable function P(E|*) : (Y,B) R (the real numbers) 
such tiiat for each F c 6, m c P 

p(EnT’‘(F)) « f P(E|y)dpT"‘(y) . 

In another nonequivalent deflnitlo|i of a sufficient statistic given 
by Lehmann and Scheffe'(3), 8 is always taken to be 8^ , the 
largest o-ficld on Y consistent with the mcasureabi 1 ity of T 
Bahadur [1] discusses the relationship between these two definitions 
at length. 

In this paper our particular concern is tiiat of developing 
necessary a.jd sufficient conditions that a surjective bounded 
linear operator T from a Banach space X to a Banach space Y 
be a sufficient statistic, where A and 8 are the respective 
Bore] fields of X and Y . Our first theorem shows that under 
a very natural condition the aforementioned definitions of 
sufficiency are equivalent. Specifically, the condition is that 
ker T “ {x e X|Tx = 0} be complemented in X ; that is, for some 
closed subspace S of X , X =* ker T ® S . (For example, if X 
is a Hilbert space, take S » (ker T)"*“.) As a corollary we obtain 
a simple characterization of sufficient linear statistics for 


» 
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dominated sets of measures. In Theorem 2. we replace the condition 
that ker \T be complemented with copditions on .the density functions 
corresponding to a dominated set V , Finally, wo give applications 
ofthese results that characterize linear sufficient statistics 
for families of the exponential type, including as special cases 
the Wlshart and multivariate normal distributions. The latter 
result is used to establish precisely which procedures for sampling 
from a normal population have tl)e property that the sample mean is 
a sufficient statistic. This generalizes the classical result that 
the sample mean is sufficient for independent samples. The final 
result deals with the connection between linear sufficient statistics 
and the Gauss-Markov theorem. 

If W is a Banach space, 6(W) will denote the Borel field 
generated by the open sets of W . The totally finite measures 
defined on B(W) will be denoted by M(W) . We will write p<<v 
for the relation of absolute continuity and dp/dv for the equiva- 
lence class of Radon-Nikodym derivatives of w with respect to v . 
For the definitions of a dominated set of measures, equivalent sets 
of measures, and their connection with o-finite measures defined 
on B(W), we refer the reader to Halmos and Savage [2]. 

2. Principal Results : Our first theorem shows that if ker T is 
complemented in S then, the two definitions of sufficiency 
described in the introduction are equivalent. 

Theorem 1 : Let X and Y be Banach spaces, let A = B(X) and lot 

T be a surjective bounded linear operator from X to Y such that 
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ker T la comp Irnu riled iu X . Then 8^ = 8(Y) . 


• • 

Proof: Since T Ik Dorel mensurcable. it suffices to show that 

■ I 

6,pC^(Y) • L.ct S bo a closed subspacc of X such that 

* _ I • 

X - ker T © S . If F c 8^. , then T (F) c 8(X) and if T 

denotes the restriction of T to S , then 

T (F) - T (F)r^ S c 8(X) . It follows that T (F) t B(S) , and 

“ * * _ 1 
since T is a topological isomorphism, F - TT (F) c 8(V) < 

Henceforth, we will assume that X and Y are Banach spaces; 
A ■ 8(X) , 8 ■ 8(Y) and T:(X,A) ^ (Y',8) Is a surjective bounded 
linear operator. According to (2, Leiruna 7), for a dominated 
collection of measures V C M(X) a measure X , equivalent to 
V , can be defined by 

m 

MK) s I a n.(E) 
i-1 ‘ ^ 

where is a countable subset of V which is equivalent 

^ i-1 

to V and ? < « . Obviously, if V is homogeneous, wo 

i-1 ^ ^ 

can take X c V . Combining the results of Theorem 1 with those 
of Lemma 2 and Theorem 1 of [2], we have: 


Theorem 2 : If ker T is complemented in X , then T is sufficient 
for V if and only if for each p c P there exists a real valued 
function on Y such that K^o T c du/dX . 


Proof : By. Theorem 1 of [2], T is sufficient if and only if for 

each u c P there exists a real valued Borel measureablc function 
g^ on Y such that ByO T c dw/dX . Since ker T is complemented 
in X , 8(Y) = B.J, and each real valued function g^ such that 


- 3 - 


g^T is Borcl tncasurcablo on X must be Dorel mcasurcablc on Y . 

, In all that follows 6g(x;z) will denote the Gateaux 

differential of the function g at x in the direction of z . 

« 

Corollary 1 ; If ker T is complemented in X , then T is 
sufficient for V if and only if for each u c P there exists 
f^ c dp/dX such that x c X and y c ker T implies 6f^(x;y) ■ 0 . 

Proof : If T is sufficient, then for each p c P there exists 
g^:Y •* R such that f^ ■ BpO T c dp/dX . It follows immediately 
that 6f^(x;y) ■ 0 for each x c X, y c ker T . 

If f^ c dp/dX and 6f^(x;y) “ 0 for p c P, x t X, y c ker T 
then f^(x+y) » each x c X , y c ker T . For z c Y 

define Ky(z) “ where z ■ Tx . Then is well defined 

and f^ ■ KyO T • Hence, T is sufficient. 

The next theorem concerns a replacement of the complemented 
kernel condition whenever there is a continuous Radon-Nikodym 
derivative f^ t dp/dX for each p c P . 

Theorem 3 ; Let V C X be an open set such that X(X'vV) ■ 0 and 
let X(U) > 0 for each nonempty open subset U of V . Suppose 
X(B+y) “ 0 whenever li C V , X(B) ■ 0 and y c ker T . For 
each p c P , let f^ c dp/dX be continuous on V . Then T is 
suf f icient • if and only if whenever x, z c V and 

Tx » Tz . 

Proof: If T is a sufficient statistic, then there exists g.. t dp 

such that g (x) “ B (z) uhenever x, z e V, Tx = Tz . Let p c- P 
p u 

and y t ker T be fixed. Tlie set 


is an open subset of V contained in IH* (D-y ) > vvhere 
B - {x c V I f^(x) / g^(x) } . 

Since X(B) ■ 0 , it follows from the hypothesis that A(U) ■ 0 

and hence, U ■ P . Thus ^^(x) “ f^(x+y) whenever x, x+y c V . 

Conversely, suppose ^p(x) • w t P , x , 7. c V 

whenever Tx ■ Tz . The function g^:T(V) R defined by 

g^(Tx) ■ f^(x) for X c V is well defined on T(V) . Since 

f is continuous on V, f "RnT on V, and T is an open 

mapping, it follows that g^ is continuous on the open sot T(V) 

For y ^ T(V) define Kp(V) “ 0 . Then g^ is Borel measure- 

able on Y and f “ g qT . Thus T is sufficient for V . 

P p“ 

The proof of the following corollary is clear and will bo 
omitted . 


Corollary 2 ; If, in addition to the hypotheses of Theorem 4, the 
set V is convex, then T is sufficient for V if and only if 
4fp(x;y) • 0 for each peP,xeV,yc ker T . 

3. Exponential Families : Let X and Y bo Banach spaces, 

(H,<*1*>) a Hilbert space and v a o-finite measure on B(X) 
such that v(X''<V) ■ 0 for some nonempty open convex set V C X 
for which ’vCU) > 0 for each nonempty open set U C V . Let 
P “ • > c r be a family of probability measures having 

exponential densities f^(x) = c(y)h(x) exp <Q(Y)|t(x)>c dp^/dv 
where c(y) > 0, h(x) > 0 on V a.e.(v), t:X -* H is continuous 


- 5 - 


und Gateaux di f feront iublc on V , and Q:r H . 


Theorem A . Let T;X ■♦V be linear,* bounded, sUrJectlve and 
v(fa^y) ■ 0 whenever U c 8(X), B C V, v(B) ■ 0 and y c ker T . 

If • 0 c r , T is a sufficient statistic for the exponential 
family V if and only if <Q(y) - Q( 6 ) | ' t (x ; y ) > ■ 0 for each 

Y c r , X t X and y c ker T . 

Proof : Under the stated assumptions V is homogeneous and thus 

X may be taken to bo an arbitrary element, say p , of P . 
Applying Corollary 2, T is sufficient for P if and oi.ly if 
4 Rv o(x;y) ■ 6 for each Ycr,xtV yc ker T , where 

Y , P 

Kv “ c( Y ) exp { < Q(Y ) - Q( 0) I t(x)>) . 

c(0) 

This is equivalent to <Q(y) - Q(0)| 4t(x;y)>" 0 for each 
Ycr,xtV,ye ker T . 

AplJl icat ion s . Let S denote the symmetric n ^ n matrices, 
r /»^tho positive definite elements of S and P a family of 
Wishart probability measures with m ^ n degrees of freedom having 
densities 

» c(Y)|S| exp {- I tr (y"*S)} . 

Theorem 5 . If 0 c r and T:S range (T) is linear, then T 
is a sufficient statistic for the Wishart family P if and only 
if tr [(y~ -B” )K] “ 0 for each y t f and K c ker T . 

Proof. The preliminary conditions of Theorem <1. are satisfied with 

Y " Lebesgue measure on S and the obvious identifications of c(y) 
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and h(S) . Let H equal S with <A|H> i tr(AB) , t(S) - S 
aid Q(y)'“ -1 /2 . Observe that •6t(S;F) ■ F- and apply Theorem <i . 

Remark: Theorem 5. implies that there is a nontrivial linear 

sufficient statistic if and only if there exists a linear mani- 
fold W S such that y t W for eacli y t r . 

Wo will now apply these results to normal families of 
probability monsures. In Theorem G. we will state set theoretical, 
algebraic and geometrical conditions, each equivalent to the 
condition that T be a linear sufficient statistic lor a family 
P ■ • y c r of normal n-variate probability measures having 

densities, ’vlth respect to Lebesgue measure on R*’ , 

p^(x) - exp (-^ (x-n^)l 

We will assume tljat for some 8 c r , Oo 0 and n„ •* I . 

P P 

This requirement imposes no loss of generality since for any 
B c r tijere exists a non singular matrix M for which 

P 

MgflgM' ■ I and a change of coordinate system defined by the 
transformat i'^*' x -* M.(x-no) allows one to recover the sufficient 

P P 

statistic in tlie original coordinate system. 

^ n ic 

Theorem G . If T:R -* R is a linear transformation of rank k 
and V ■ ^^Y^ ’ Y e r is an arbitrary family of n-variato normal 
probability measures such that for some f c r , * 0 and 

fjg ■ I then the following conditions are c(|uivalent: 
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(1) T is sufficient for V - (P^) , y e r. 

(2) kcr'TC- Hj. (ker(H^-I) n ln^l'^1 

(3) For each y c r , 

(a) T^Tn^ - 

(b) T^T(n^-I) - n^-I 

where the notation ( * )^ denotes the generalized inverse of (*) • 

Proof: To see that (1) -► (2) observe that the preliminary 

conditions of Theorem 4. are satisfied with v ■ Lebesgue measure 
on X ■ r” . Make the obvious identifications for c(y) and 
h(x) . Lot denote the n « n real matrices and define 

Q:r -► H " R*' « , t : X -* H and <*|0 on H , respectively, 

by Q(y) - (-0^ /2, n^n^/2) , t(x) - (xx'.x, I) 

and <(Aj,Wj,Bp|(A 2 ,W 2 ,H 2 )> " tr(AJA 2 > ^ ♦ tr(B'B 2 > . 

Since Q, t and <* | *> satisfy the remaining hypotheses of 
Theorem 1. and it(x’,z) ■ (xz' + z x; z,6 ) for each x, z c r” , 
it follows that for each y c r ; 

ker T C {ycR":x'(n^*-I )y - - 0 . x t r") 

- ker (n"*-I ) n (n“* n^]*^ - ker(n^-I ) n ( 

To see that (2) ^ (3) note that T^T is the orthogonal 
projection on range (T') ■ (ker T)'*' . Since n^c(ker T)'*’ , 

(3a) holds. Furthermore, ker T^T ■ ker T c ker (n^-I) implies 
range (n^-I) c range (T*T) and hence that T^T (n^-I) ■ (n^-I) 
which is (3b) . 

In order to sec that (3) -* (1) recall the definition of 

Q(y) . t(x) and the fact that 6t(x;z) ■ (xz'+z xf z, 6 ) . 
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We need imiy show that x'(n^-I)y - o'y ■ 0 for. each y t T , 

X'c X and y c ker T . Using (3b) and symmetry together with 
(3a) it follows that 

x'(n^-l)y - n'y • x*(n^-I )T‘^(Ty ) - n^T”^(Ty) - 0 . 

Wc state the following corollary without proof. 

Corollary 3 . Under the hypotheses of Theorem 6. , there exists 
a k K n rank k sufficient statistic for ^ ‘ ^ 

and only if there o\ists a rank k orthogonal projection P on r" 
such that (a) Pn.^ “ and (h) P(n^-I) ■ ^y”^ each y c r 

Moreover, any k * n rank k matrix such that T^T ■ P is a 
sufficient statistic for {P } , y c '*• 

Y 

Corollary 4 . If r ■ {0, !,•••, m-1) , 'Iq" ® i ^o** ^ 

R 5 I Hj I h2 1 * • • I I Oj-I I fl2“M * * ' I then T is a linear 

sufficient statistic for the finite family ^*’y^ ' y c r of 
n-variate normal probability measures if and only if 
range (T') ■ range (B) . Moreover, k ■ rank B is the smallest 
Integer for which there exists a k « n sufficient statistic for 
(P^) , Y c r. 

Proof: The equivalent condition is an immediate consequence of 

Theorem 6. The minimality statement follows from the fact that 
if T is a p « n rank p sufficient statistic then T*TB ■ B , 
hence, T^TBB^ ■ BU^ . It follows that range (BD^) C range (T^T) 
and, since (BB^)B ■ B, BB* satisfies Theorem 6.(3) so that k ■ 


• • • 
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Examp 1«! 1 . Lot x^, *2'‘**’ *n ' 

N(m,o) >nirlableB «uch the Joint density of '‘j **2 ' * * * **n 

ill N(pC^.Qj^) where t' • (1,1, •••,!) . Lot (P^) , m t R 

* . 
be. the family of probability measures having dcnsltios 

and T / 6 a 1 * n matrix. 

« 

Observe tliat T is sufficient for (P ) , p c It if and 

1/2 ^ 
only if TQ^ is sufficient for the family of probability' 

' _l /? 

measures , p c R having densities ^n’^^ and, 

according to Theorem 6., that this is equivalent to the condition 
j/2 _l/2 X 

that ker T ii c ( 1 . This is equivalent to (' ■ «„Tn„ 

n n n nnn 

for some scalar a . A simple calculation shows that 

n 

■ n(Tn^(^) so that the statistic T is sufficient for (P ) , 

p c R if and only if T ■ ( (Tfl C ) C'O ]/n . In particular, 

n n n n 
* 111 

note that T - T i (t'o“ L )“ C'fl” 1** sufficient for (P. ), p c R and tli. 

'^n n '’n' 'n n u 

A 

T (Xj , • • • ,x^ )' is an unbiased estimate of p for each integer n. 

This generalizes the classical result that the sample mean is a 
sufficient statistic for p when the samples Xj,X 2 ,*** are 
Independcr t . 


Further note that if T i (the statistic T for the 

sample mean) is a sufficient statistic for . W c R 

for each integer n , the column sums (row sums) of are 

identically • A routine induction argument shows 

that, in the latter case, Cov “ constant for i, Jsl,2,***, 

i f J. 


Example 2 . Let y « Wy ♦ c , 
of rank n and c N (0,1). 


the minimum variance unbiased 
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where W is a fixed m « n matrix 

According to the Gauss-Markov theorem, 

* — 1 

linear estimate of y is y • (W'W) W'y 
- 10 - 
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I 


Lot T ■ (W'W)“ W' and observe t' ^ for y c , 

T'(TT')"«T Wy - Wy and, slnco T*(TT')" T-T*T, /rheor«>m (i. Innilh *. T »h f 

HUfficlont statistic for the sot of probability mousuroH (P ) . 

y 

r t having densities N(Wy,I) . 

« 

On tho other hand, if T is a sufficient linear statistic 

for (P ) , Y c r'' such that Ty is an unbiased estimate of y 

* * 

then, since TW ■ I , T has rank n . Corollary 4 ii.tplies that 

n is tho smallest integer for which there exists a linear n » m 

n * - * 

sufficient statistic for . Y t R • Moreover, T ■ B(W'W) W' 

for some nonsingular n « n matrix B . Since TW ■ I , 

• l 

T • (W'W) W' . 

» 

Since y ■ Ty , the Gauss-Markov estimate of y may bo 
characterized '.he unique linear sufficient statistic T for 
(P^) , Y c r” for which Ty is an unbiased estimate of y • 
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